Analytical Mean Squared Error Curves in Temporal Diierence Learning

نویسندگان

  • Satinder Singh
  • Peter Dayan
چکیده

We have calculated analytical expressions for how the bias and variance of the estimators provided by various temporal diierence value estimation algorithms change with ooine updates over trials in absorbing Markov chains using lookup table representations. We illustrate classes of learning curve behavior in various chains, and show the manner in which TD is sensitive to the choice of its step-size and eligibility trace parameters.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analytical Mean Squared Error Curves in Temporal Difference Learning

Peter Dayan Brain and Cognitive Sciences E25-210, MIT Cambridge, MA 02139 [email protected] We have calculated analytical expressions for how the bias and variance of the estimators provided by various temporal difference value estimation algorithms change with offline updates over trials in absorbing Markov chains using lookup table representations. We illustrate classes of learning curve...

متن کامل

Analytical Mean Squared Error Curves in Temporal Di erence Learning

We have calculated analytical expressions for how the bias and variance of the estimators provided by various temporal di erence value estimation algorithms change with o ine updates over trials in absorbing Markov chains using lookup table representations. We illustrate classes of learning curve behavior in various chains, and show the manner in which TD is sensitive to the choice of its steps...

متن کامل

Evaluation of remote sensing indicators in drought monitoring using machine learning algorithms (Case study: Marivan city)

Remote sensing indices are used to analyze the Spatio-temporal distribution of drought conditions and to identify the severity of drought. This study, using various drought indices generated from Madis and TRMM satellite data extracted from Google Earth Engine (GEE) platform. Drought conditions in Marivan city from February to November for the years 2001 to 2017 were analyzed based on spatial a...

متن کامل

Using Machine Learning ARIMA to Predict the Price of Cryptocurrencies

The increasing volatility in pricing and growing potential for profit in digital currency have made predicting the price of cryptocurrency a very attractive research topic. Several studies have already been conducted using various machine-learning models to predict crypto currency prices. This study presented in this paper applied a classic Autoregressive Integrated Moving Average(ARIMA) model ...

متن کامل

Ordering Points for Incremental TIN Construction from DEMs

The standard method of building compact triangulated surface approximations to terrain surfaces (TINs) from dense digital elevation models(DEMs) adds points to an initial sparse triangulation or removes points from a dense initial mesh. Typically, in each triangle in the current TIN, the worst tting point, in terms of vertical distance, is selected. The order of insertion of the points is deter...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1988